REMARKS 



The Office Action dated March 21, 2006 ("Office Action") has been received and noted. 
Claims 1-16 and 30 were examined. Claims 1-16 and 30 were rejected. Claims 2, 5-16 are 
withdrawn. Claims 1, 3-5 and 30 are amended. Support for the amendments can be found in, for 
example, pages 18 and 19 of the Application. As such, no new matter has been added. Applicants 
respectfully request reconsideration of the claims in view of the above-amendments and the 
following remarks. 

Applicants would like to express their appreciation to the Examiner for promptly 
responding to inquiries made by Applicants with respect to the Office Action. 

The Examiner states that the Applicants cannot rely upon the foreign priority papers to 
overcome rejections because the translation of the foreign priority documents has not been made 
of record in accordance with 37 C.F.R. 1.55. Applicants herein enclose a certified copy of an 
English language translation of Korean Application No. 10-2002-0003184. 

The Examiner rejects the incorporation by reference of GenBank U80456 because protein 
and nucleic acid sequences that are an essential part of the invention must be disclosed by 
sequence. In response, Applicants state that Mr. Hanguan Liu of NCBI was contacted and stated 
that U80456 was first released into the GenBank public database on May 2, 1997 and was 
updated once on July 8, 1997. Following a comparison of the May 2, 1997 entry and the July 8, 
1997 entry, Applicants found that there is no difference between the two sequences. Absent a 
specific inquiry regarding date of conception, the earliest priority date is January 19, 2002 based 
on Korean Application No. 10-2002-0003184. Thus, Applicants respectfully submit that 
GenBank U80456 can be incorporated be referenced. (See email correspondence from Jason Lee 
to Steve De Klerk dated September 13, 2006 and accompanying attachments, attached hereto as 
Exhibit A). Accordingly, Applicants respectfully request withdrawal of this rejection. 

The specification was objected to for the misspelling of "alanine" and "arginine" on page 
18, line 13. Appropriate correction has been made. 
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Claims 1-16 and 30 were objected to due to various informalities. Claims 2 and 6-16 have 
been withdrawn. Appropriate correction has been made to remaining claims 1, 3-5 and 30. 

I. Claims Rejected Under 35 U.S.C. § 101 

Claims 1-4 and 30 were rejected under 35 U.S.C. § 101 as being directed to non-statutory 
subject matter according to the Examiner. Claim 2 is withdrawn. Amended claim 1 reads "[a] 
peptide comprising SEQ. ID NO.:l derived from human transcription factor SIM2 beginning at 
the 558th marker of human transcription factor SIM2 and ending at the 566th marker of human 
transcription factor SIM2, wherein the peptide is capable of transducing a biologically active, 
functional or/and regulatory molecule into prokaryotic cells or eukaryotic cells." Amended claim 
30 reads "[a] method of transducing a peptide into a prokaryotic or eukaryotic cell comprising 
preparing a peptide construct comprising SEQ. ID N0..1 derived from human transcription factor 
SIM2 beginning at the 558th marker of human transcription factor SIM2 and ending at the 566th 
marker of human transcription factor SIM2, wherein the peptide includes a biologically active, 
functional or/and regulatory molecule; and delivering the peptide construct in vivo to a subject 
through administration routes comprising intramascular, intraperitoneal, intravein, oral, nasal, 
subcutaneous, intradermal, mucosal and inhalation routes." The term "derived" recites a 
limitation of the "hand in man" indicating that the claimed peptide is isolated and/or purified. 
Dependent claims 3-4 depend on independent claim 1 and therefore include all of its limitations. 
Applicants respectfully submit that the amendments overcome the Examiner's rejection. 

II. Claims Rejected Under 35 U.S.C. § 112, first paragraph 

Claims 1-16 and 30 were rejected under 35 U.S.C. § 112, first paragraph for failing to meet 
the enablement requirement according to the Examiner. Amended claim 1 reads "[a] peptide 
comprising SEQ. ID NO.:l derived from human transcription factor SIM2 beginning at the 558th 
marker of human transcription factor SIM2 and ending at the 566th marker of human 
transcription factor SIM2, wherein the peptide is capable of transducing a biologically active, 
functional or/and regulatory molecule into prokaryotic cells or eukaryotic cells." Amended claim 
30 reads "[a] method of transducing a peptide into a prokaryotic or eukaryotic cell comprising 
preparing a peptide construct comprising SEQ. ID NO.:l derived from human transcription factor 
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SIM2 beginning at the 558th marker of human transcription factor SIM2 and ending at the 566th 
marker of human transcription factor SIM2, wherein the peptide includes a biologically active, 
functional or/and regulatory molecule; and delivering the peptide construct in vivo to a subject 
through administration routes comprising intramascular, intraperitoneal, intravein, oral, nasal, 
subcutaneous, intradermal, mucosal and inhalation routes." The Examiner is on record for stating 
that the specification is enabling for a peptide comprising the amino acid sequence of SEQ. ID 
NO.: 1, but has stated that the specification is not enabling for claims which include the term 
"active fragments". (Office Action, p. 5, 6) Claims 1, 3-4 and 30 no longer contains the term 
"active fragments". Amended claim 5 reads "[a] recombinant expression vector comprising a 
DNA sequence encoding a peptide comprising SEQ. ID NO.:l derived from human transcription 
factor SIM2 beginning at the 558th marker of human transcription factor SIM2 and ending at the 
566th marker of human transcription factor SIM2." The Examiner is on record for stating that the 
specification is enabling for a recombinant expression vector consisting of SEQ. ID NO.:l, but 
has stated that the specification is not enabling for claims for other limitations in claims 5, 6, 9, 
10-11, 14-16. (Office Action, p. 8) Those limitations and/or claims of which the Examiner rejects 
have been canceled and/or withdrawn. Accordingly, Applicants respectfully submit that claims 1, 
3-5 and 30 satisfy the enablement requirement of 35 U.S.C. § 112, first paragraph. 

HI. Claims Rejected Under 35 U.S.C. § 112, first paragraph 

Claims 1-16 and 30 were rejected under 35 U.S.C. § 112, first paragraph for failing to 
meet the written description requirement according to the Examiner. Amended claim 1 is drawn 
to a single species, i.e., "[a] peptide comprising SEQ. ID NO.:l derived from human 
transcription factor SIM2 beginning at the 558th marker of human transcription factor SIM2 and 
ending at the 566th marker of human transcription factor SIM2." Amended claim 30 is drawn to 
a single species, i.e., "[a] method of transducing a peptide into a prokaryotic or eukaryotic cell 
comprising preparing a peptide construct comprising SEQ. ID NO.:l derived from human 
transcription factor SIM2 beginning at the 558th marker of human transcription factor SIM2 and 
ending at the 566th marker of human transcription factor SIM2." Claims 1, 3-4 and 30 no longer 
contains the term "active fragments," of which the Examiner views as being directed to 
additional species. In addition, amended claim 5 is drawn to a single species, i.e., "[a] 
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recombinant expression vector comprising SEQ. ID NO.:l derived from human transcription 
factor SIM2 beginning at the 558th marker of human transcription factor SIM2 and ending at the 
566th marker of human transcription factor SIM2." Limitations and/or claims of which the 
Examiner rejects as being directed to additional species in claims 5, 6, 9, 10-11, 14-16 have been 
withdrawn and/or canceled. Accordingly, Applicants respectfully submit that claims 1, 3-5 and 30 
satisfy the written description requirement of 35 U.S.C. § 112, first paragraph. 

IV. Claims Rejected Under 35 U.S.C. § 112. second paragraph 

Claims 1-16 and 30 were rejected under 35 U.S.C. § 112, second paragraph for being 
indefinite according to the Examiner. Amended claims 1 and 30 claim a specific peptide derived 
from SEQ. ID NO.:l comprising an amino acid sequence beginning at the 558th marker of SEQ. 
ID NO.:l and ending at the 566th marker of SEQ. ID NO.:l. Dependent claims 3-4 depend on 
independent claim 1 and therefore include all of its limitations. Amended claim 5 claim a specific 
recombinant expression vector comprising a DNA sequence encoding a peptide derived from 
SEQ. ID NO.:l comprising an amino acid sequence beginning at the 558th marker of SEQ. ID 
NO.:l and ending at the 566th marker of SEQ. ID NO.:l. The remaining claims have been 
withdrawn and/or canceled. Accordingly, Applicants respectfully submit that claims 1, 3-5, and 
30 are definite under 35 U.S.C. § 112, second paragraph. 

Claim 13 was rejected under 35 U.S.C. § 112, second paragraph as being indefinite. 
Claim 13 is withdrawn. 

Claims 12-14 and 16 were rejected under 35 U.S.C. § 112, second paragraph as being 
indefinite. Claims 12-14 and 16 are withdrawn. 

Claims 4, 8 and 30 were rejected under 35 U.S.C. § 112, second paragraph as omitting 
essential steps according to the Examiner. Amended claim 4 reads "[t]he peptide of claim 1, 
wherein the peptide is transduced into the cells of prokaryotes or eukaryotes and administered in 
vivo through administration routes comprising intramuscular, intraperitoneal, intravein, oral, 
nasal, subcutaneous, intradermal, mucosal and inhaling routes." Amended claim 30 reads "[a] 
method of transducing a peptide into a prokaryotic or eukaryotic cell comprising: preparing a 
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peptide construct comprising SEQ. ID NO.:l derived from human transcription factor SIM2 
beginning at the 558th marker of human transcription factor SIM2 and ending at the 566th 
marker of human transcription factor SIM2, wherein the peptide includes a biologically active, 
functional or/and regulatory molecule; and delivering the peptide construct in vivo to a subject 
through administration routes comprising intramascular, intraperitoneal, intravein, oral, nasal, 
subcutaneous, intradermal, mucosal and inhalation routes." Thus, it is clear from the claim 
language that the peptide or peptide construct is delivered via standard delivery routes in vivo and 
tranduces a cell once it reaches the cell. Claim 8 is withdrawn. Applicants respectfully submit 
that the amendments in claims 4 and 30 overcome the Examiner's rejection. 

V. Claims Rejected Under 35 U.S.C. § 102 

A. 

Claims 1-3 were rejected under 35 U.S.C. § 102(a) as being anticipated by International 
Publication No. WO 01/57277 A2 to Penn et al. ("Penn"). A claim is anticipated only if each and 
every element as set forth in the claim is found, either expressly or inherently described, in a 
single prior art reference. MPEP 2131. Applicants respectfully submit that each and every 
element in amended claim 1 and its respective dependent claim is not set forth in the cited 
references. 

Amended claim 1 reads "[a] peptide comprising SEQ. ID NO.:l derived from human 
transcription factor SIM2 beginning at the 558th marker of human transcription factor SIM2 and 
ending at the 566th marker of human transcription factor SIM2, wherein the peptide is capable of 
transducing a biologically active, functional or/and regulatory molecule into prokaryotic cells or 
eukaryotic cells." The nine amino acid residue from the 558th marker of SEQ. ID NO.:l to the 
566th marker of SEQ. ID NO.:l consists of the sequence AKAARQAAR. By contrast, Penn 
describes genome-derived single exon microarrays useful for verifying the expression of regions 
of genomic DNA predicted to encode protein. (Penn, Field of Invention) SEQ. ID NO. :3 1606 of 
Penn includes a 96 amino acid residue which includes the sequence ARAARQAAR, or, more 
particularly, wherein a lysine is conservatively substituted for arginine when compared to the 
sequence of the claim 1. (Result 6 of Examiner's Search Strategy (.rag)) Thus, the cited reference 
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does not disclose each and every element of independent claim 1. Dependent claim 3 depends 
from independent claim 1 and therefore includes all of its limitations. Accordingly, Applicants 
respectfully submit that independent claim 1 and dependent claim 3 are allowable over the cited 
reference. 

B. 

Claims 1-9, 11, 14-16 and 30 were rejected under 35 U.S.C. § 102(e) as being anticipated 
by U.S. Patent No. 6,780,642 to Narayanan ("Narayanan"). Applicants respectfully submit that 
Narayanan is not enabled with respect to amended independent claims 1, 5 and 30. Narayanan 
describes methods of detecting cancer in a biological sample by detecting SIM2 nucleic acid or 
protein in the sample and methods for treating cancer and identifying compounds that modulate 
SIM2 expression. (Narayanan, Abstract) Among the sequences disclosed in Narayanan is SEQ. 
ID NO.: 3 which is the native form of the human transcription factor SIM2 gene and contains 667 
amino acid residues, (col. 39-43) SEQ. ID NO.:3 is simply a repeat of U80456 of GenBank. (see 
GenBank U80456 available at http://www.ncbi.nlm.nih.gov/Genbank/GenbankSearch.html) 
Narayanan is not enabled with respect to SEQ. ID NO.:l of independent claims 1, 5 and 30 
because Narayanan does not enable one skilled in the art to make and use the claimed invention. 
There is no indication that Narayanan specifically identifies SEQ. ID NO.:l or any of its newly 
discovered capabilities, as discovered by Applicants and claimed in the Application. Narayanan 
does not give any working examples of SEQ. ID NO.:l and is void in direction or guidance with 
respect to how to make and use SEQ. ID NO.:l. Moreover, if every newly discovered peptide, 
protein or nucleic acid sequence encoding a newly discovered peptide or protein derived from a 
known nucleic acid sequence precluded patentability, then no patent would issue for sequences, 
including the sequences disclosed in Narayanan. Accordingly, Applicants respectfully submit 
that independent claims 1, 5 and 30 are allowable over the cited reference. 

Applicants respectfully remind the Examiner that prior art rejections should ordinarily be 
confined strictly to the best available art. MPEP 706.02(1). The Examiner rejects the same claims 
(1-3) under both 35 U.S.C. § 102(a) and 102(e) citing different references. In subsequent any 
offices action, should one issue, Applicants respectfully request that the Examiner only cite the 
best reference in any rejection. 
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CONCLUSION 



In view of the foregoing, it is believed that all claims now pending patentably define the 
subject invention over the prior art of record and are in condition for allowance and such action is 
earnestly solicited at the earliest possible date. 

Respectfully submitted, 

BLAKELY, SOKOLOFF, TAYLOR & ZAFMAN LLP 



Date: ff//g/o6 



%U0 fay 



Shelley M. Cobos, Reg. No. 56,174 



12400 Wilshire Boulevard 
Seventh Floor 

Los Angeles, California 90025 
Telephone (310) 207-3800 
Facsimile (310) 820-5988 



CERTIFICATE OF MAILING 
I hereby certify that this correspondence is being deposited with the 
United States Postal Service on the date shown below with sufficient 
postage as first class mail in an envelope addressed to: Commissioner for 
Patents, P.O. Box 1450, Alexandria, VA 22313-1450. 




Melissa Stead 



Date 
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Shelley Cobos 



To: 



Cc: 



Sent: 



From: 



Steve De_Klerk 

Wednesday, September 13, 2006 7:22 PM 
Shelley Cobos 
Linda Brest 



Subject: FW: [Urgent] U.S. National Stage Patent Application 10/501 ,964 

Attachments: U80456_comparison_result.pdf; U80456 (July-8-1997).pdf; U80456 (May-2-1997).pdf 

Shelly, please see below and attached. There was also a fax that came in yesterday that we are sending 
on to you. 

From: NAM AND NAM [mailto:nampat@nampat.co.kr] 
Sent: Wed 9/13/2006 6:46 PM 
To: Steve DeJ<lerk 

Subject: Re: [Urgent] U.S. National Stage Patent Application 10/501,964 
Dear Mr. De Klerk : 

This is further to our letter of September 13, 2006. 

As mentioned in our previous letter, we sent an e-mail to the NCBI and received a reply from 
Mr. Hanguan Liu of the NCBI as follows: 

"Regarding the update history of u80456, the accession number, u80456, was first released into the 
GenBank public database on May 2, 1997 and was updated once on July 8, 1997. 

See the records below: 

http://www.ncbi.nlm.nih,gov/entrez/viewer.fcgi?2062416:NCBI:274571 
http://www.ncbi.nlm.nih.gOv/entrez/viewer.fcgi72062416:OLDID:3445559" 

We have compared the sequence released on May 2, 1997 with the sequence updated on July 8, 
1997. Consequently, we confirm that there is no difference between the two sequences. 

Enclosed herewith please find the information of the above sequences and the comparison result. 

Thank you for your continuous cooperation. 

Yours sincerely, 



Jason LEE 
Patent Attorney 




9/14/2006 



NAM & NAM World Patent & Law Firm. 
Kwanghwamoon P.O.Box 58 Seoul Korea 

Maekyung Media Center 9th Fl., 30, 1-Ga, Pil-Dong, Jung-Ku, Seoul, Korea 
Phone : 82 2 753-5477 Fax : 82 2 753-7315 
E-mail : nampat@nampat.co.kr 
Http://www.nampatxo.kr 

The information contained in this transmission is privileged and 
confidential information intended only for use of the individual or 
entity named above. If you are not the intended recipient, you are 
hereby notified that any dissemination, distribution or copying of this 
communication is strictly prohibited. If you have received this 
transmission in error, please immediately reply to the sender that you 
received this communication and delete it. 
Thank you. 
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LOCUS HSU80456 3921 bp mRNA linear PRI 02 - MAY 

4997 08-JUL-1997 

DEFINITION Human transcription factor SIM2 long form mRNA, complete cds. 

ACCESSION U80456 

VERSION U80456.1 Gl:2062416 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryotac; mitochondr i a l cukaryotos; Mctazoa; Chordato; 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

Vertebrata; Euthcr i a; Primates; Catarrhini; Hominidae; Homo. 

REFERENCE 1 (bases 1 to 3921) 

AUTHORS Chrast.R., Scott.H.S., Chen.H., Kudoh.J., Rossier.C., Chcn, H ., MinoGh i ma.S., 
Shimizu.N , 

Minoshima,S., 

Wang.Y., Shimizu.N. and Antonarakis.S.E. 

TITLE Cloning of S I M2, a human homologuo t wo human homologs of the Drosophila 
s i ng l e m i nd o d single-minded gene 

geno 

SIM1 on chromosome 6q and SIM2 on 21q within the Down syndrome 

JOURNAL Unpublish e d 

chromosomal region 
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JOURNAL Genome Res. 7 (6), 615-624 (1997) 



PUBMED 9199934 



REFERENCE 2 (bases 1 to 3921) 



AUTHORS Chrast.R., Kudoh.J., Rossier.C, Chen.H., Minoshima.S., Shimizu.N. 



and Antonarakis.S.E. 



TITLE Direct Submission 



JOURNAL Submitted (29-NOV-1996) Medical Genetics, University of Geneva 



Medical School, 1, Rue Michel-Servet, Geneva 1211, Switzerland 



FEATURES 



source 



CDS 



Location/Qualifiers 
1..3921 

/organism="Homo sapiens" 

/mol_type= n mRNA" 

/db_xref="taxon:9606" 

/chromosome-^r 

/map= M 21q22.2" 

93..2096 
/codon_start=1 

/product="transcription factor SIM2 long form" 

/protoin_„ ^ d^ ,, 2Q62 4 17 7 prote^nJd^ , AAB62396.r , 
/db_xref="GI:2062417" 



/translation="MKEKSKNAAKTRREKENGEFYELAKLLPLPSAITSQLDKASIIR 
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LTTSYLKMRAVFPEGLGDAWGQPSRAGPLDGVAKELGSHLLQTLDGFVFWASDGKIM 

YISETASVHLGLSQVELTGNSIYEYIHPSDHDEMTAVLTAHQPLHHHLLQEYEIERSF 

FLRMKCVLAKRNAGLTCSGYKVIHCSGYLKIRQYMLDMSLYDSCYQIVGLVAVGQSLP 

PSAITEIKLYSNMFMFRASLDLKLIFLDSRVTEVTGYEPQDLIEKTLYHHVHGCDVFH 

LRYAHHLLLVKGQVTTKYYRLLSKRGGWVWVQSYATWHNSRSSRPHCIVSVNYVLTE 

1EYKELQLSLEQVSTAKSQDSWRTALSTSQETRKLVKPKNTKMKTKLRTNPYPPQQYS 

SFQMDKLECGQLGNWRASPPASAAAPPELQPHSESSDLLYTPSYSLPFSYHYGHFPLD 

SHVFSSKKPMLPAKFGQPQGSPCEVARFFLSTLPASGECQWHYANPLVPSSSSPAKNP 

PEPPANTARHSLVPSYEAPAAAVRRFGEDTAPPSFPSCGHYREEPALGPAKAARQAAR 

DGARLALARAAPECCAPPTPEAPGAPAQLPFVLLNYHRVLARRGPLGGAAPAASGLAC 
APGGPEAATGALRLRHPSPAATSPPGAPLPHYLGASVIITNGR" 

ORIGIN 

1 actcactata gggctcgagc ggccgcccgg gcaggtgggg ctccgcgggc ctggagcacg 
61 gccgggtcta atatgcccgg agccgaggcg cgatgaagga gaagtccaag aatgcggcca 
121 agaccaggag ggagaaggaa aatggcgagt tttacgagct tgccaagctg ctcccgctgc 
181 cgtcggccat cacttcgcag ctggacaaag cgtccatcat ccgcctcacc acgagctacc 
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241 tgaagatgcg cgccgtcttc cccgaaggtt taggagacgc gtggggacag ccgagccgcg 
301 ccgggcccct ggacggcgtc gccaaggagc tgggatcgca cttgctgcag actttggatg 
361 gatttgtttt tgtggtagca tctgatggca aaatcatgta tatatccgag accgcttctg 
421 tccatttagg cttatcccag gtggagctca cgggcaacag tatttatgaa tacatccatc 
481 cttctgacca cgatgagatg accgctgtcc tcacggccca ccagccgctg caccaccacc 
541 tgctccaaga gtatgagata gagaggtcgt tctttcttcg aatgaaatgt gtcttggcga 
601 aaaggaacgc gggcctgacc tgcagcggat acaaggtcat ccactgcagt ggctacttga 
661 agatcaggca gtatatgctg gacatgtccc tgtacgactc ctgctaccag attgtggggc 
721 tggtggccgt gggccagtcg ctgccaccca gtgccatcac cgagatcaag ctgtacagta 
781 acatgttcat gttcagggcc agccttgacc tgaagctgat attcctggat tccagggtga 
841 ccgaggtgac gggttacgag ccgcaggacc tgatcgagaa gaccctatac catcacgtgc 
901 acggctgcga cgtgttccac ctccgctacg cacaccacct cctgttggtg aagggccagg 
961 tcaccaccaa gtactaccgg ctgctgtcca agcggggcgg ctgggtgtgg gtgcagagct 
1021 acgccaccgt ggtgcacaac agccgctcgt cccggcccca ctgcatcgtg agtgtcaatt 
1081 atgtactcac ggagattgaa tacaaggaac ttcagctgtc cctggagcag gtgtccactg 
1 141 ccaagtccca ggactcctgg aggaccgcct tgtctacctc acaagaaact aggaaattag 
1201 tgaaacccaa aaataccaag atgaagacaa agctgagaac aaacccttac cccccacagc 
1261 aatacagctc gttccaaatg gacaaactgg aatgcggcca gctcggaaac tggagagcca 
1321 gtccccctgc aagcgctgct gctcctccag aactgcagcc ccactcagaa agcagtgacc 
1381 ttctgtacac gccatcctac agcctgccct tctcctacca ttacggacac ttccctctgg 
1441 actctcacgt cttcagcagc aaaaagccaa tgttgccggc caagttcggg cagccccaag 
1501 gatccccttg tgaggtggca cgctttttcc tgagcacact gccagccagc ggtgaatgcc 
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1561 agtggcatta tgccaacccc ctagtgccta gcagctcgtc tccagctaaa aatcctccag 
1621 agccaccggc gaacactgct aggcacagcc tggtgccaag ctacgaagcg cccgccgccg 
1681 ccgtgcgcag gttcggcgag gacaccgcgc ccccgagctt cccgagctgc ggccactacc 
1741 gcgaggagcc cgcgctgggc ccggccaaag ccgcccgcca ggccgcccgg gacggggcgc 
1801 ggctggcgct ggcccgcgcg gcacccgagt gctgcgcgcc cccgaccccc gaggccccgg 
1861 gcgcgccggc gcagctgccc ttcgtgctgc tcaactacca ccgcgtgctg gcccggcgcg 
1921 gaccgctggg gggcgccgca cccgccgcct ccggcctggc ctgcgctccc ggcggccccg 
1981 aggcggcgac cggcgcgctg cggctccggc acccgagccc cgccgccacc tccccgcccg 
2041 gcgcgcccct gccgcactac ctgggcgcct cggtcatcat caccaacggg aggtgacccg 
2101 ctggccgccc gcgccaggag cctggacccg gcctcccggg gctgcggcgc caccgagccc 
2161 ggcaaatgcg cacgacctac attaatttat gcagagacag ctgtttgaat tggaccccgc 
2221 cgccgacttg cggatttcca ccgcggaggc cccgcgcgcc ggtgccgagg gccgaggagc 
2281 gcccgggtcc gggcaggtga ccgcccgcct ctgtcctgcg agggccggtg cgacccagtt 
2341 gctgggggct tggtttcctc accttgaaat cgggcttcac gcgtcttgcc ttgtccccaa 
2401 cgttccacaa cagtcccgct gggggattga agcggtttca ctccgcaaat atcctccact 
2461 ttcaggaggg aaaacccacc ctaccacagt ccgctcttcc aagtggacgg cagacctggg 
2521 aggggacgcc tgtgtcacga gcccttttag atgcttaggt gaaggcagaa gtgatgattg 
2581 taagtcccat gaatacacaa ctccactgtc tttaaaagtc attcaagagt ctcattattt 
2641 ttgtttttat ttaacccttt cttcaataca aaaagccaac aaaccaagac taagggggtg 
2701 accatgcaat tccattttgt gtctgtgaac ataggtgtgc ttcccaaata cattaacaag 
2761 ctcttacttc cccctaaccc ctatgaactc ttgataacac caagagtagc accttcagaa 
2821 tatattgaat aggcattaaa tgcaaaaata tatatgtagc cagacagttt atgagaatga 
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2881 ccctgtcaag cttcattatt acgtggcaaa atccctctgg cccacacaga tctgtaattc 
2941 actaggctcg tgtttgctac aaatagtgct aataaagtta aattgcacgt gcaatacgga 
3001 acactgtcaa tggactgcac cttgtgaagg aaaaacatgc ttaagggggt gtaatgaaaa 
3061 tgatgtagac attttaagca ttttctacac agcgagaaaa cttcgtaaga acatgttacg 
3121 tgtgcaacag gtaaacagaa atcctttcat aaagcaccag cagtgtttaa aaaatgagct 
3181 tccattaatt tttacttttt atgggttttg cttaaagatc tcaacatgga aaaatcctgt 
3241 catggctctg aactgcacaa tgcattgaac cgccgtcctt caattttctt cacactatca 
3301 acactgcagc attttgctgc tttatcaaaa tggtttattt taggaaactt tttccacctt 
3361 tctgaatgga aagaggtttt cacaaatgtt ttaaactcat cgttctaaaa tcaagtgcac 
3421 ctacaccaac tgctctcaaa atgtgaactg actttttttt tttttttttt gccaaccctg 
3481 tgtcacttag tgaggacctg acacaatccc tacagggtgt ctgtcagtgg gcctcatggt 
3541 aagagtcaca atttgcaaat ttaggaccgt gggtcatgca gcgaaggggc tggatggtag 
3601 gaagggatgt gcccgcctct ccacgcactc agctatacct cattcacagc tccttgtgag 
3661 tgtgtgcaca ggaaataagc cgagggtatt atttttttat gttcatgagt cttgtaatta 
3721 aaccgtgatt cttgaaaggt gtaggtttga ttactaggag ataccaccga catttttcaa 
3781 taaagtactg caaaatgctt ttgtgtctac cttgttatta acttttgggg ctgtatttag 
3841 taaaaataaa tcaaggctat cggagcagtt caataacaaa ggttactgtt gagaaaaaag 
3901 accctatcat agatttacaa g 
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□ l: U80456. Reports Human transcripti...[gi:2062416] 
features Sequence 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



CDS 



HSU80456 3921 bp mRNA linear PRI 08-JUL-1997 

Human transcription factor SIM2 long form mRNA, complete cds . 
U80456 

U80456.1 GI:2062416 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3921) 

Chrast,R., Scott, H.S., Chen,H., Kudoh,J., Rossier,C, Minoshima, S . , 
Wang,Y., Shimizu,N. and Antonarakis, S . E . 

Cloning of two human homologs of the Drosophila single-minded gene 

SIM1 on chromosome 6q and SIM2 on 21q within the Down syndrome 

chromosomal region 

Genome Res. 7 (6), 615-624 (1997) 

9199934 

2 (bases 1 to 3921) 

Chrast,R., Kudoh,J., Rossier,C, Chen,H., Minoshima, S . , Shimizu,N. 
and Antonarakis, S . E. 
Direct Submission 

Submitted (29-NOV-1996) Medical Genetics, University of Geneva 
Medical School, 1, Rue Michel-Servet, Geneva 1211, Switzerland 

Location/Qualif iers 

1 . .3921 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/chromosome="21" 

/map="21q22.2" 

93. .2096 

/codon_start=l 

/product="transcription factor SIM2 long form" 
/protein_id="AAB62396. 1" 
/db_xref="GI:2062417" 

/ translation="MKEKSKNAAKTRREKENGEFYELAKLLPLPSAITSQLDKASI IR 
LTTSYLKMRAVFPEGLGDAWGQPSRAGPLDGVAKELGSHLLQTLDGFVFWASDGKIM 
Y I SETASVHLGLSQVELTGNS I YEY I HPSDHDEMTAVLTAHQPLHHHLLQE YE IERSF 
FLRMKCVLAKRNAGLTCSGYKVIHCSGYLKIRQYMLDMSLYDSCYQIVGLVAVGQSLP 
PSAITEIKLYSNMFMFRASLDLKLIFLDSRVTEVTGYEPQDLIEKTLYHHVHGCDVFH 
LRYAHHLLLVKGQVTTKYYRLLSKRGGWVWVQSYATVVHNSRSSRPHCIVSVNYVLTE 
IEYKELQLSLEQVSTAKSQDSWRTALSTSQETRKLVKPKNTKMKTKLRTNPYPPQQYS 
SFQMDKLECGQLGNWRASPPASAAAPPELQPHSESSDLLYTPSYSLPFSYHYGHFPLD 
SHVFSSKKPMLPAKFGQPQGSPCEVARFFLSTLPASGECQWHYANPLVPSSSSPAKNP 
PEPPANTARHSLVPSYEAPAAAVRRFGEDTAPPSFPSCGHYREEPALGPAKAARQAAR 
DGARLALARAAPECCAPPTPEAPGAPAQLPFVLLNYHRVLARRGPLGGAAPAASGLAC 
APGGPEAATGALRLRHPSPAATSPPGAPLPHYLGASVIITNGR" 



ORIGIN 



1 actcactata gggctcgagc ggccgcccgg gcaggtgggg ctccgcgggc ctggagcacg 
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61 gccgggtcta atatgcccgg agccgaggcg cgatgaagga gaagtccaag aatgcggcca 
121 agaccaggag ggagaaggaa aatggcgagt tttacgagct tgccaagctg ctcccgctgc 
181 cgtcggccat cacttcgcag ctggacaaag cgtccatcat ccgcctcacc acgagctacc 
241 tgaagatgcg cgccgtcttc cccgaaggtt taggagacgc gtggggacag ccgagccgcg 
301 ccgggcccct ggacggcgtc gccaaggagc tgggatcgca cttgctgcag actttggatg 
361 gatttgtttt tgtggtagca tctgatggca aaatcatgta tatatccgag accgcttctg 
421 tccatttagg cttatcccag gtggagctca cgggcaacag tatttatgaa tacatccatc 
481 cttctgacca cgatgagatg accgctgtcc tcacggccca ccagccgctg caccaccacc 
541 tgctccaaga gtatgagata gagaggtcgt tctttcttcg aatgaaatgt gtcttggcga 
601 aaaggaacgc gggcctgacc tgcagcggat acaaggtcat ccactgcagt ggctacttga 
661 agatcaggca gtatatgctg gacatgtccc tgtacgactc ctgctaccag attgtggggc 
721 tggtggccgt gggccagtcg ctgccaccca gtgccatcac cgagatcaag ctgtacagta 
781 acatgttcat gttcagggcc agccttgacc tgaagctgat attcctggat tccagggtga 
841 ccgaggtgac gggttacgag ccgcaggacc tgatcgagaa gaccctatac catcacgtgc 
901 acggctgcga cgtgttccac ctccgctacg cacaccacct cctgttggtg aagggccagg 
961 tcaccaccaa gtactaccgg ctgctgtcca agcggggcgg ctgggtgtgg gtgcagagct 
1021 acgccaccgt ggtgcacaac agccgctcgt cccggcccca ctgcatcgtg agtgtcaatt 
10 81 atgtactcac ggagattgaa tacaaggaac ttcagctgtc cctggagcag gtgtccactg 
1141 ccaagtccca ggactcctgg aggaccgcct tgtctacctc acaagaaact aggaaattag 
1201 tgaaacccaa aaataccaag atgaagacaa agctgagaac aaacccttac cccccacagc 
1261 aatacagctc gttccaaatg gacaaactgg. aatgcggcca gctcggaaac tggagagcca 
1321 gtccccctgc aagcgctgct gctcctccag aactgcagcc ccactcagaa agcagtgacc 
1381 ttctgtacac gccatcctac agcctgccct tctcctacca ttacggacac ttccctctgg 
1441 actctcacgt cttcagcagc aaaaagccaa tgttgccggc caagttcggg cagccccaag 
1501 gatccccttg tgaggtggca cgctttttcc tgagcacact gccagccagc ggtgaatgcc 
1561 agtggcatta tgccaacccc ctagtgccta gcagctcgtc tccagctaaa aatcctccag 
1621 agccaccggc gaacactgct aggcacagcc tggtgccaag ctacgaagcg cccgccgccg 
1681 ccgtgcgcag gttcggcgag gacaccgcgc ccccgagctt cccgagctgc ggccactacc 
1741 gcgaggagcc cgcgctgggc ccggccaaag ccgcccgcca ggccgcccgg gacggggcgc 
1801 ggctggcgct ggcccgcgcg gcacccgagt gctgcgcgcc cccgaccccc gaggccccgg 
1861 gcgcgccggc gcagctgccc ttcgtgctgc tcaactacca ccgcgtgctg gcccggcgcg 
1921 gaccgctggg gggcgccgca cccgccgcct ccggcctggc ctgcgctccc ggcggccccg 
1981 aggcggcgac cggcgcgctg cggctccggc acccgagccc^ cgccgccacc tccccgcccg 
2041 gcgcgcccct gccgcactac ctgggcgcct cggtcatcat caccaacggg aggtgacccg 
2101 ctggccgccc gcgccaggag cctggacccg gcctcccggg gctgcggcgc caccgagccc 
2161 ggcaaatgcg cacgacctac attaatttat gcagagacag ctgtttgaat tggaccccgc 
2221 cgccgacttg cggatttcca ccgcggaggc cccgcgcgcc ggtgccgagg gccgaggagc 
2281 gcccgggtcc gggcaggtga ccgcccgcct ctgtcctgcg agggccggtg cgacccagtt 
2341 gctgggggct tggtttcctc accttgaaat cgggcttcac gcgtcttgcc ttgtccccaa 
2401 cgttccacaa cagtcccgct gggggattga agcggtttca ctccgcaaat atcctccact 
2461 ttcaggaggg aaaacccacc ctaccacagt ccgctcttcc aagtggacgg cagacctggg 
2521 aggggacgcc tgtgtcacga gcccttttag atgcttaggt gaaggcagaa gtgatgattg 
2581 taagtcccat gaatacacaa ctccactgtc tttaaaagtc attcaagagt ctcattattt 
2641 ttgtttttat ttaacccttt cttcaataca aaaagccaac aaaccaagac taagggggtg 
2701 accatgcaat tccattttgt gtctgtgaac ataggtgtgc ttcccaaata cattaacaag 
2761 ctcttacttc cccctaaccc ctatgaactc ttgataacac caagagtagc accttcagaa 
2821 tatattgaat aggcattaaa tgcaaaaata tatatgtagc cagacagttt atgagaatga 
2881 ccctgtcaag cttcattatt acgtggcaaa atccctctgg cccacacaga tctgtaattc 
2941 actaggctcg tgtttgctac aaatagtgct aataaagtta aattgcacgt gcaatacgga 
3001 acactgtcaa tggactgcac cttgtgaagg aaaaacatgc ttaagggggt gtaatgaaaa 
3061 tgatgtagac attttaagca ttttctacac agcgagaaaa cttcgtaaga acatgttacg 
3121 tgtgcaacag gtaaacagaa atcctttcat aaagcaccag cagtgtttaa aaaatgagct 
3181 tccattaatt tttacttttt atgggttttg cttaaagatc tcaacatgga aaaatcctgt 
3241 catggctctg aactgcacaa tgcattgaac cgccgtcctt caattttctt cacactatca 
3301 acactgcagc attttgctgc tttatcaaaa tggtttattt taggaaactt tttccacctt 
3361 tctgaatgga aagaggtttt cacaaatgtt ttaaactcat cgttctaaaa tcaagtgcac 
3421 ctacaccaac tgctctcaaa atgtgaactg actttttttt tttttttttt gccaaccctg 
3481 tgtcacttag tgaggacctg acacaatccc tacagggtgt ctgtcagtgg gcctcatggt 
3541 aagagtcaca atttgcaaat ttaggaccgt gggtcatgca gcgaaggggc tggatggtag 
3601 gaagggatgt gcccgcctct ccacgcactc agctatacct cattcacagc tccttgtgag 
3661 tgtgtgcaca ggaaataagc cgagggtatt atttttttat gttcatgagt cttgtaatta 
3721 aaccgtgatt cttgaaaggt gtaggtttga ttactaggag ataccaccga catttttcaa 
3781 taaagtactg caaaatgctt ttgtgtctac cttgttatta acttttgggg ctgtatttag 
3841 taaaaataaa tcaaggctat cggagcagtt caataacaaa ggttactgtt gagaaaaaag 
3901 accctatcat agatttacaa 
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□ l: U80456. Reports Human transcriptL.[gi:20624 16] 
Features Sequence 
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HSU80456 3921 bp mRNA linear PRI 02-MAY-1997 

Human transcription factor SIM2 long form mRNA, complete cds. 
U80456 

U80456 GI:2062416 



(human) 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



Homo sapiens 
Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3921) 

Chrast,R., Kudoh,J., Rossier,C, Chen,H., Minoshima, S . , Shimizu,N. 
and Antonarakis, S . E . 

Cloning of SIM2, a human homologue of the Drosophila single minded 
gene 

Unpublished 

2 (bases 1 to 3921) 

Chrast,R., Kudoh,J., Rossier,C, Chen,H., Minoshima, S . , Shimizu,N. 
and Antonarakis, S . E . 
Direct Submission 

Submitted (29-NOV-1996) Medical Genetics, University of Geneva 
Medical School, 1, Rue Michel-Servet, Geneva 1211, Switzerland 
Location/Qualifiers 
1. .3921 

/organism= M Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/chromosome="21" 
/map="21q22.2" 
93. .2096 
/codon_start=l 

/product="transcription factor SIM2 long form" 
/protein_id="2062417" 
/db_xref="GI: 2062417" 

/translation="MKEKSKNAAKTRREKENGEFYELAKLLPLPSAITSQLDKASIIR 
LTTSYLKMRAVFPEGLGDAWGQPSRAGPLDGVAKELGSHLLQTLDGFVFWASDGKIM 
YISETASVHLGLSQVELTGNSI YEYIHPSDHDEMTAVLTAHQPLHHHLLQEYEIERSF 
FLRMKCVLAKRNAGLTCSGYKVIHCSGYLKIRQYMLDMSLYDSCYQIVGLVAVGQSLP 
PSAITEIKLYSNMFMFRASLDLKLIFLDSRVTEVTGYEPQDLIEKTLYHHVHGCDVFH 
LRYAHHLLLVKGQVTTKYYRLLSKRGGWVWVQSYATVVHNSRSSRPHCIVSVNYVLTE 
IEYKELQLSLEQVSTAKSQDSWRTALSTSQETRKLVKPKNTKMKTKLRTNPYPPQQYS 
SFQMDKLECGQLGNWRASPPASAAAPPELQPHSESSDLLYTPSYSLPFSYHYGHFPLD 
SHVFSSKKPMLPAKFGQPQGSPCEVARFFLSTLPASGECQWHYANPLVPSSSSPAKNP 
PEPPANTARHSLVPSYEAPAAAVRRFGEDTAPPSFPSCGHYREEPALGPAKAARQAAR 
DGARLALARAAPECCAPPTPEAPGAPAQLPFVLLNYHRVLARRGPLGGAAPAASGLAC 
APGGPEAATGALRLRHPSPAATSPPGAPLPHYLGASVI ITNGR" 

1 actcactata gggctcgagc ggccgcccgg gcaggtgggg ctccgcgggc ctggagcacg Exhibit, 

61 geegggtcta atatgecegg agecgaggeg cgatgaagga gaagtccaag aatgeggeca 

121 agaccaggag ggagaaggaa aatggcgagt tttacgagct tgccaagctg ctcccgctgc 

181 cgtcggccat cacttcgcag ctggacaaag cgtccatcat ccgcctcacc acgagctacc 
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241 tgaagatgcg cgccgtcttc cccgaaggtt taggagacgc gtggggacag ccgagccgcg 

301 ccgggcccct ggacggcgtc gccaaggagc tgggatcgca cttgctgcag actttggatg 

361 gatttgtttt tgtggtagca tctgatggca aaatcatgta tatatccgag accgcttctg 

421 tccatttagg cttatcccag gtggagctca cgggcaacag tatttatgaa tacatccatc 

481 cttctgacca cgatgagatg accgctgtcc tcacggccca ccagccgctg caccaccacc 

541 tgctccaaga gtatgagata gagaggtcgt tctttcttcg aatgaaatgt gtcttggcga 

601 aaaggaacgc gggcctgacc tgcagcggat acaaggtcat ccactgcagt ggctacttga 

661 agatcaggca gtatatgctg gacatgtccc tgtacgactc ctgctaccag attgtggggc 

721 tggtggccgt gggccagtcg ctgccaccca gtgccatcac cgagatcaag ctgtacagta 

781 acatgttcat gttcagggcc agccttgacc tgaagctgat attcctggat tccagggtga 

841 ccgaggtgac gggttacgag ccgcaggacc tgatcgagaa gaccctatac catcacgtgc 

901 acggctgcga cgtgttccac ctccgctacg cacaccacct cctgttggtg aagggccagg 

961 tcaccaccaa gtactaccgg ctgctgtcca agcggggcgg ctgggtgtgg gtgcagagct 

1021 acgccaccgt ggtgcacaac agccgctcgt cccggcccca ctgcatcgtg agtgtcaatt 

1081 atgtactcac ggagattgaa tacaaggaac ttcagctgtc cctggagcag gtgtccactg 

1141 ccaagtccca ggactcctgg aggaccgcct tgtctacctc acaagaaact aggaaattag 

1201 tgaaacccaa aaataccaag atgaagacaa agctgagaac aaacccttac cccccacagc 

1261 aatacagctc gttccaaatg gacaaactgg aatgcggcca gctcggaaac tggagagcca 

1321 gtccccctgc aagcgctgct gctcctccag aactgcagcc ccactcagaa agcagtgacc 

1381 ttctgtacac gccatcctac agcctgccct tctcctacca ttacggacac ttccctctgg 

1441 actctcacgt cttcagcagc aaaaagccaa tgttgccggc caagttcggg cagccccaag 

1501 gatccccttg tgaggtggca cgctttttcc tgagcacact gccagccagc ggtgaatgcc 

1561 agtggcatta tgccaacccc ctagtgccta gcagctcgtc tccagctaaa aatcctccag 

1621 agccaccggc gaacactgct aggcacagcc tggtgccaag ctacgaagcg cccgccgccg 

1681 ccgtgcgcag gttcggcgag gacaccgcgc ccccgagctt cccgagctgc ggccactacc 

1741 gcgaggagcc cgcgctgggc ccggccaaag ccgcccgcca ggccgcccgg gacggggcgc 

1801 ggctggcgct ggcccgcgcg gcacccgagt gctgcgcgcc cccgaccccc gaggccccgg 

1861 gcgcgccggc gcagctgccc ttcgtgctgc tcaactacca ccgcgtgctg gcccggcgcg 

1921 gaccgctggg gggcgccgca cccgccgcct ccggcctggc ctgcgctccc ggcggccccg 

1981 aggcggcgac cggcgcgctg cggctccggc acccgagccc cgccgccacc tccccgcccg 

2041 gcgcgcccct gccgcactac ctgggcgcct cggtcatcat caccaacggg aggtgacccg 

2101 ctggccgccc gcgccaggag cctggacccg gcctcccggg gctgcggcgc caccgagccc 

2161 ggcaaatgcg cacgacctac attaatttat gcagagacag ctgtttgaat tggaccccgc 

2221 cgccgacttg cggatttcca ccgcggaggc cccgcgcgcc ggtgccgagg gccgaggagc 

2281 gcccgggtcc gggcaggtga ccgcccgcct ctgtcctgcg agggccggtg cgacccagtt 

2341 gctgggggct tggtttcctc accttgaaat cgggcttcac gcgtcttgcc ttgtccccaa 

2401 cgttccacaa cagtcccgct gggggattga agcggtttca ctccgcaaat atcctccact 

2461 ttcaggaggg aaaacccacc ctaccacagt ccgctcttcc aagtggacgg cagacctggg 

2521 aggggacgcc tgtgtcacga gcccttttag atgcttaggt gaaggcagaa gtgatgattg 

2581 taagtcccat gaatacacaa ctccactgtc tttaaaagtc attcaagagt ctcattattt 

2641 ttgtttttat ttaacccttt cttcaataca aaaagccaac aaaccaagac taagggggtg 

2701 accatgcaat tccattttgt gtctgtgaac ataggtgtgc ttcccaaata cattaacaag 

2761 ctcttacttc cccctaaccc ctatgaactc ttgataacac caagagtagc accttcagaa 

2821 tatattgaat aggcattaaa tgcaaaaata tatatgtagc cagacagttt atgagaatga 

2881 ccctgtcaag cttcattatt acgtggcaaa atccctctgg cccacacaga tctgtaattc 

2941 actaggctcg tgtttgctac aaatagtgct aataaagtta aattgcacgt gcaatacgga 

3001 acactgtcaa tggactgcac cttgtgaagg aaaaacatgc ttaagggggt gtaatgaaaa 

3061 tgatgtagac attttaagca ttttctacac agcgagaaaa cttcgtaaga acatgttacg 

3121 tgtgcaacag gtaaacagaa atcctttcat aaagcaccag cagtgtttaa aaaatgagct 

3181 tccattaatt tttacttttt atgggttttg cttaaagatc tcaacatgga aaaatcctgt 

3241 catggctctg aactgcacaa tgcattgaac cgccgtcctt caattttctt cacactatca 

3301 acactgcagc attttgctgc tttatcaaaa tggtttattt taggaaactt tttccacctt 

3361 tctgaatgga aagaggtttt cacaaatgtt ttaaactcat cgttctaaaa tcaagtgcac 

3421 ctacaccaac tgctctcaaa atgtgaactg actttttttt tttttttttt gccaaccctg 

3481 tgtcacttag tgaggacctg acacaatccc tacagggtgt ctgtcagtgg gcctcatggt 

3541 aagagtcaca atttgcaaat ttaggaccgt gggtcatgca gcgaaggggc tggatggtag 

3601 gaagggatgt gcccgcctct ccacgcactc agctatacct cattcacagc tccttgtgag 

3661 tgtgtgcaca ggaaataagc cgagggtatt atttttttat gttcatgagt cttgtaatta 

3721 aaccgtgatt cttgaaaggt gtaggtttga ttactaggag ataccaccga catttttcaa 

3781 taaagtactg caaaatgctt ttgtgtctac cttgttatta acttttgggg ctgtatttag 

3841 taaaaataaa tcaaggctat cggagcagtt caataacaaa ggttactgtt gagaaaaaag 
3901 accctatcat agatttacaa g 
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